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If the in-put symbols to a commiinicaiion channel represent the outcomes 
of a chance event on which bets arc available at odds consistent with their 
probabilities {i.e., ''fair" odds), a gambler can vse the knowledge given 
him by the received symbols to cause his money to grow exponentially. The 
maximum exponential rate of growth of the gambler's capital is equal to 
the rate of transmissiem of information over the channel. This result is 
generalized to include the case of arbitrary odds. 

Thus we find a situation in which the transmission rate is significant 
even though no coding is contemplated. Previously this quantity was given 
significance only by a theorem, of Shannon's which asserted that, with suit- 
able encoding, binary digits coidd be transmitted over the channel at this 
rate with an arbitrarily small probability of error. 

INTRODTJCTION 

Shannon defines the rate of transmission over a noisy communication 
channel in terms of various probabihties/ This definition is given sig- 
nificance by a theorem which asserts that binary digits may be encoded 
and transmitted over the cliannel at this rate with arbitrarily small 
probalnlity of error. Many workers in the field of communication theoiy 
have felt a desire to attach significance to the rate of transmission in 
cases where no coding was contemplated. Some have even proceeded 
on the assumption that such a significance did, in fact, exist. For ex- 
ample, in systems where no coding was desirable or even possible (such 
as radar), detectors have been designed by the criterion of maximum 
transmission rate or, what is the same thing, minimum equivocation. 
Without further analysis such a procedure is unjustified. 

The problem then remains of attaching a value measure to a communi- 



1 C. E. Shannon, A Mathematical Theory of Communication, B. S.T.J. , 27, 
pp. 379-123, 623-656, Oct., 1948. 
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cation system in which errors are being made at a non-negligible rate, 
i.e., where optimum coding is not being used. In its most general formu- 
lation this problem seems to have but one solution. A cost function must 
be defined on pairs of symbols which tell how bad it is to receive a cer- 
tain symbol when a specified f^ignal is transmitted. Furthermore, this 
cost function must be such that its expected value has significance, i.e., 
a system must be preferable to another if its average cost is less. The 
utility theory of Von Neumann^ shows us one way to obtain such a cost 
function. Generally this cost function would depend on things external 
to the system and not on the probabilities which describe the system, so 
that its average value could not be identified with the rate as defined 
by Shannon. 

The cost function approach is, of course, not limited to studies of com- 
munication systems, but can actually be used to analyze nearly any 
branch of human endeavor. The author believes that it is too general to 
shed any light on the specific problems of communication theory. The 
distinguishing feature of a communication system is that the ultimate 
receiver (thought of here as a pereon) is in a position to profit from any 
knowledge of the input symbols or even from a better estimate of their 
probabilities. A cost function, if it is supposed to apply to a communica- 
tion system, must somehow reflect this feature. The point here is that 
an arbitrary combination of a statistical transducer {i.e., a channel) and 
a cost function does not necessarily constitute a communication system. 
In fact (not knowing the exact definition of a communication system 
on which the above statements are tacitly based) the author would not 
know how to test such an arbitrary combination to see if it were a com- 
munication system. 

What can be done, however, is to take some real-life situation which 
seems to possess the essential features of a communication problem, and 
to analyze it without the introduction of an arbitrary cost function. 
The situation which will be chosen here is one in which a gambler uses 
knowledge of the received symbols of a communication channel in order 
to make profitable bets on the transmitted symbols. 

THE GAMBLER WITH A PRIVATE WIRE 

Let us consider a communication channel which is used to transmit the 
results of a chance situation before those results become common 
knowledge, so that a gambler may still place bets at the original odds. 
Consider first the case of a noiseless binary channel, ^vhich might be 

' Von Neumann and Morgenstein, Theory of Games and Economic Behavior, 
Princeton Univ. Press, 2nd Edition, 1947. 
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used, for example, to transmit the results of a series of basebail games 
lietween two equally matched teams. The gamljler could obtain even 
money bets even though he ah-eady knew the result of each game. The 
amount of money he could make would depend only on how much he 
chose to bet. How much would he bet? Probably all he had since he 
would win with certainty. In thiw case his capital would grow expo- 
nentialiy and after A'^ bets he would have 2'^ times his original bankroll. 
This exponential growth of capital is not uncommon in economics. In 
fact, if the binary digits in the above channel were arriving at the rate 
of one per week, the sequence of bets would have the value of an invest- 
ment paying 100 per cent interest per week compounded weekly. We 
will make use of a quantity G called the exponential rate of growth of 
the gambler's capital, where 

G = Urn - log -~ 

N—a iV Ko 

where V/f is the gambler's capital after A^ bets, T'o is his starting capital, 
and the logarithm is to the base two. In the above example G = 1. 

Consider the case now of a noisy binaiy channel, where each trans- 
mitted symbol has probability, p, or error and q of correct transmission. 
Now the gambler could still bet his entire capital each time, and, in 
fact, this would maximize the expected value of his capital, (Vn), 
which in this case would be given by 

(Vy) = (25) Vo 

This would be little comfort, however, since when A'' was large he would 
probably be broke and, in fact, would be broke with probability one if 
he contiinied indefinitely. Let us, instead, assume that he bets a frac- 
tion, (,, of his capital each time. Then 

T^v - (I +0"'(1 - O'Vo 
where 11' and L are the number of wins and losses in the N bets. Then 

G = Lim 



iV-foS 



]^log(l + 0+|log(l-0] 



^ 5 log (1 + + P log (1 — () with probability one 

Let us maximize G with respect to f. The maximum value with respect 
to the Yi of a quantity of the form Z — ^ X,- log Yi , subject to the 
constraint ^ 1'; = Y, is obtained by putting 

Y =1 X 
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where X = ^Xi. This may be shown directly from the convexity of 
the logarithm. 
Thus we put 

(1 + ^) = 2q 

{!-€) = 2p 

and 

Gmnx = 1 + P log P + 3 log 9 

= R 

which is the rate of transmission as defined by Shannon. 

One might still argue that the gambler should bet all his money 
(make ^ = 1) in order to maximize his expected win after A'' times. It 
is surely true that if the game were to be stopped after A'' bets the answer 
to this question would depend on the relative values (to the gambler) 
of being broke or possessing a fortune. If we compare the fates of two 
gamblers, however, playing a nonterminating game, the one which uses 
the value t found above will, with probability one, eventually get ahead 
and stay ahead of one using any other t At any rate, we will assume 
that the gambler will always bet so as to maximize G. 

THE GENERAL CASE 

Let us now consider the case in which the channel has several input 
symbols, not necessarily equally likely, which represent the outcome of 
chance events. We will use the following notation: 

p(s) the probability that the transmitted symbol is the s'th one. 
p{r/s) the conditional probability that the received symbol is the 

r'th on the hypothesis that the transmitted symbol is the s'th 

one. 
p(8, r) the joint probability of the s'th transmitted and r'th received 

symbol. 
q{r) received symbol probability. 
qls/r) conditional probability of transmitted symbol on hypothesis 

of received symbol. 
a, the odds paid on the occurrence of the s'th transmitted symbol, 

i.e., a, is the number of dollars returned for a one-dollar bet 

(including that one dollar). 
a{s/r) the fraction of the gambler's capital that he decides to bet on 

the occurrence of the s'th transmitted symbol after observing 

the r'th received symbol 
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Only the case of independent transmitted symbols and noise will be 
considered. We will consider first the case of "fair" odds, i.e., 

1 
p(s) 

111 any sort of parimutual betting there is a tendency for the odds to be 
fair (ignoring the "track take"). To see this first note that if there is no 
"track take" 



Ei = i 



since all the money collected is paid out to the winner. Next note that if 

Of, > —7-ir 

pis) 

for some s a bettor could insure a profit by making repeated bets on the 
s outcome. The extra betting which would result would lower a^ . 
The same feedback mechanism probably takes place in more compli- 
cated betting situations, such as stock market speculation. 
There is no loss in generality in assuming that 

Z ais/r) = 1 

i.e., the gambler bets his total capital regardless of the received symbol. 
Since 

he can effectively hold back money by placing canceling bets. Now 

where W,r is the number of times that the transmitted symbol is s and 
the received symbol is r. 

Log tT = X) T^" log «.a(s/r) 

V ^^^ 

G = Lim ^r log -rj? = 52 P(s> log a,a(s/r) 

with probability one. Since 

1 

Pis) 
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here 

= J^p{s,r)\oga{s/r) + H(X) 

where H{X) is the source rate as defined by Shannon. The first t«rm is 
maximized by putting 

ais/r) = /%^ ^pM^ ,(,/,) 
X,p{k, r) 5(r) 

Then G^nx = H{X) - HiX/Y), which is the rate of transmission de- 
fined by Shannon. 

^\TIEN THE ODDS ARK NOT I''AIR 

Consider the case where there is no track take, i.e., 

but where a^ is not necessarily 

1 

V{s) 

It is still permissible to set ^, a(s/r) = 1 since the gambler can effec- 
tively hold back any amount of money by betting it in proportion to 
the 1/a, . Equation (1) now can be written 

G = £ V(^> ^) 'og ais/r) + J^ pis) log a, . 

G is still maximized by placing a{s/r) = q{s/r) and 

a = -H{X/Y) + E K-^O log a. 

= H{a) - H(X/Y) 
where 

Hia) = XI P(«) log «» 

Several interesting facts emerge here 

(a) In this case G is maximized as before by putting a{s/7-) = qis/r). 
That is, the gambler ignores the posted odds in placing his bets! 



/ 
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(b) Since the minimum value of H(a) subject to 

obtains when 

1 

pis) 

and H{X) = H(a), any deviation from fair odds helps the gambler. 

(c) Since the gambler's exponential gain would be H{a) — H{X) if 
he had no inside information, we can interpret H = H(X) — H(X/Y) 
as the increase of Gm^x due to the communication channel. When there 
is no channel, i.e., H(X/Y) = H(X), Gmax is minimized (at zero) by set- 
ting 

1 

as = ~ 

This gives further meaning to the concept "fair odds," 

WHEN THERE IS A "TRACK TAKE" 

In the case there is a "track take" the situation is more complicated. 
It can no longer be aasumed that ^s a{s/T) — 1 . The gambler cannot 
make canceling bets since he loses a percentage to the track. Let br ^ 
1 — 2^« C'(s/r), i.e., the fraction not bet when the received symbol is 
the / one. Then the quantity to be maximized is 

G = 2 pis, r) log [br + aMs/r)], (2) 

ra 

subject to the constraints 

8 

In maximizing (2) it is sufficient to maximize the terms involving a 
particular value of r and to do this separately for each value of r since 
both in (2) and in the associated constraints, terms involving different 
r's are independent. That is, we must maximize terms of the type 

Crr = 9(r)E l(s/r) log [br + aMsM] 
subject to the constraint 
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Actually, each of these terms is the same form as that of the gambler's 
exponential gain where there is no channel 

G = EpCs)log[i> + «.aCs)]. (3) 

We will maximize (3) and interpret the results either as a typical 
term in the general problem or as the total exponential gam in the case 
of no communication channel. Let us designate by X the set of indices, 
s, for which o(s) > 0, and by X' the set for which a{s) = 0. Now at the 
desired maximum 

J^= P(^)«- loge==fc forseX 

da{s) b + a{s)aa 

?? = V_£f$_-loge=/. 

Jff =P(!hMoge<A; for se\' 

where fc is a constant. The equations yield 

1 -p 



ft = log e, h = 



1 - a 



o(s) = p(s) for se\ 

where p = Sx?>(s), ff = X)x (!/«.), and the inequalities yield 

p(s)a, ^ b = for scX' 

1 — ff 

We will see that the conditions 
a < 1 

pis)as > SE\ 

1 — ff 

pis) a, ^ , for seK' 

i — 0" 



completely determine X. 

If we permute indices so that 



p{s)at g pCs + l)a>+i 
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then X must consist of all s g i where t is a positive integer or zero. 
Consider how the fraction 

1 — <Tt 

varies with i, where 

Pi = Zl vis), o-i = Z) - ; 7^0 = 1 

1 1 OLs 

Now if v{l)ar < 1, Ft increases with i until at ^ 1. In this case t = 
satisfies the desired conditions and X is empty. If p(1)q;i > 1 i^( de- 
creases with t until p{i + l)a(+i < i^f or trt ^ 1. If the former occurs, 
i.e., 'p{t + l)a,+i < Ft , then Ft+i > Ft and the fraction increases until 
o"( ^ 1 . In any case tlie desired value of t is the one which, gives Ft its 
minimum positive \'alue, or if there is more than one such value of t, 
the smallest. The maximizing process may be summed up as follows: 
fa) Permute indices so that 'p{s)a, ^ p{s + 1)q;,+i 

(b) Set h equal to the minimum positive value of 

1 where Vt=llv (s), at = ^ — 

i- ~ (Tt 1 10!, 

(c) Set a{s) = p(s) — h/as or zero, whichever is larger. (The a(s) 
will sum to 1 — b.) 

The desired maximum G will then be 



G'mnx = 2Z 7){s) log p(s)a, + (1 — Pi) log 



1 -Pi 

1 - (T, 

where t is the smallest index which gives 

1 -Pi 
1 - ff, 

its minimum positive value. 

It should be noted that if p(s)a, < 1 for all s no bets are placed, but 
if the largest pis)as > 1 some bets might be made for which p(s)ag < 1, 
i.e., the expected gain is negative. This violates the criterion of the 
classical gambler who never bets on such an event. 

CONCLUSION 

The gambler introduced here follows an essentially different criterion 
from the classical gambler. At every bet he maximizes the expected 
value of the logarithm of his capital. The reason has nothing to do with 
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the value function which he attached to his money, but merely with the 
fact that it is the logarithm which is additive in repeated bets and to 
which the law of large numbers applies. Suppose the situation were 
different; for example, suppose the gambler's wife allowed him to bet 
one dollar ea<'h week but not to reinvest his winnings. He should then 
maximize his expectation (expected value of capital) on each bet. He 
would bet all his available capital (one dollar) on the event yielding the 
highest expectation. With probability one he would get ahead of any- 
one dividing his money differently. 

It should be noted that w^e have only shown that our gambler's capital 
will surpass, with probability one, that of any gambler apportioning his 
money differently from ours but still in a fixed way for each received 
symbol, independent of time or past events. Theorems remain to be 
proved showing in what sense, if any, our strategy is superior to others 
involving a(s/r) which are not constant. 

Although the model adopted here is drawn from the real-life situation 
of gambling it is possible that it could apply to certain other economic 
situations. The essential requirements for the validity of the theoiy are 
the possibility of reinvestment of profits and the ability to control or 
vary the amount of money invested or bet in different categories. The 
"channel" of the theory might correspond to a real communication 
channel or simply to the totality of inside information available to 
the investor. 

Let us summarize briefly the results of this paper. If a gambler places 
bets on the input symbol to a communication channel and bets his money 
in the same proportion each time a particular symbol is received his, 
capital will grow (or shrink) exponentially. If the odds are consistent 
with the proliabilities of occurrence of the transmitted symbols (i.e., 
equal to their reciprocals), the maximum value of this exponential rate 
of growth will be equal to the rate of transmission of information. If the 
odds are not fair, i.e., not consistent with the transmitted symbol proba- 
bilities but consistent with some other set of probabilities, the maximum 
exponential rate of gro\\'th will be larger than it would have been with no 
channel by an amount equal to the rate of transmission of information. 
In case there is a "track take" similar results are obtained, but the 
formulae involved are more complex and have less direct information 
theoretic interpretations. 
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